CAE-Net: Cross-Modal Attention Enhancement Network for RGB-T Salient Object Detection
نویسندگان
چکیده
RGB salient object detection (SOD) performs poorly in low-contrast and complex background scenes. Fortunately, the thermal infrared image can capture heat distribution of scenes as complementary information to image, so RGB-T SOD has recently attracted more attention. Many researchers have committed accelerating development SOD, but some problems still remain be solved. For example, defective sample interfering contained or hinder model from learning proper saliency features, meanwhile low-level features with noisy result incomplete objects false positive detection. To solve these problems, we design a cross-modal attention enhancement network (CAE-Net). First, concretely fusion (CMF) module fuse where cross-attention unit (CAU) is employed enhance two modal channel used dynamically weigh features. Then, joint-modality decoder (JMD) cross-level are purified by higher level multi-scale sufficiently integrated. Besides, add single-modality (SMD) branches preserve modality-specific information. Finally, employ multi-stream (MSF) three decoders’ Comprehensive experiments conducted on datasets, results show that our CAE-Net comparable other methods.
منابع مشابه
RGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning
In this work, we propose to utilize Convolutional Neural Networks (CNNs) to boost the performance of depth-induced salient object detection by capturing the high-level representative features for depth modality. We formulate the depth-induced saliency detection as a CNN-based cross-modal transfer problem to bridge the gap between the " data-hungry " nature of CNNs and the unavailability of suff...
متن کاملAgile Amulet: Real-Time Salient Object Detection with Contextual Attention
This paper proposes an Agile Aggregating Multi-Level feaTure framework (Agile Amulet) for salient object detection. The Agile Amulet builds on previous works to predict saliency maps using multi-level convolutional features. Compared to previous works, Agile Amulet employs some key innovations to improve training and testing speed while also increase prediction accuracy. More specifically, we f...
متن کاملMSDNN: Multi-Scale Deep Neural Network for Salient Object Detection
Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...
متن کاملLocal Background Enclosure for RGB-D Salient Object Detection - Supplementary Results
The purpose of this supplementary material is to examine in detail the contributions of our proposed Local Background Enclosure (LBE) feature. A comparison of LBE with the contrast based depth features used in state-of-the-art salient object detection systems is presented. The LBE feature is compared with the raw depth features ACSD [1], DC [3] and a signed version of DC denoted SDC on the RGBD...
متن کاملNeural substrates of perceptual enhancement by cross-modal spatial attention.
Orienting attention involuntarily to the location of a sudden sound improves perception of subsequent visual stimuli that appear nearby. The neural substrates of this cross-modal attention effect were investigated by recording event-related potentials to the visual stimuli using a dense electrode array and localizing their brain sources through inverse dipole modeling. A spatially nonpredictive...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2023
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics12040953